Scalable and space-efficient Robust Matroid Center algorithms
نویسندگان
چکیده
Abstract Given a dataset V of points from some metric space, popular robust formulation the k -center clustering problem requires to select (centers) which minimize maximum distance any point its closest center, excluding z most distant (outliers) computation maximum. In this paper, we focus on an important constrained variant problem, namely, Robust Matroid Center (RMC) where set returned centers are be independent matroid rank built . Instantiating with partition yields fair has attracted interest ML community in recent years. target accurate solutions RMC under general matroids, when confronted large inputs. Specifically, devise coreset-based algorithm affording efficient sequential, distributed (MapReduce) and streaming implementations. For fixed $$\varepsilon >0$$ ε > 0 , returns featuring $$(3+\varepsilon )$$ ( 3 + ) -approximation ratio, is mere additive term $$\varepsilon$$ away 3-approximations achievable by best known polynomial-time sequential algorithms. Moreover, obliviously adapts intrinsic complexity dataset, captured doubling dimension D wide ranges $$k,z,\varepsilon D$$ k , z D our MapReduce/streaming implementations require two rounds/one pass substantially sublinear local/working memory. The theoretical results complemented extensive experiments real-world datasets, provide clear evidence accuracy efficiency algorithms their improved performance respect previous solutions.
منابع مشابه
Efficient and Robust Parameter Tuning for Heuristic Algorithms
The main advantage of heuristic or metaheuristic algorithms compared to exact optimization methods is their ability in handling large-scale instances within a reasonable time, albeit at the expense of losing a guarantee for achieving the optimal solution. Therefore, metaheuristic techniques are appropriate choices for solving NP-hard problems to near optimality. Since the parameters of heuristi...
متن کاملIntelligent scalable image watermarking robust against progressive DWT-based compression using genetic algorithms
Image watermarking refers to the process of embedding an authentication message, called watermark, into the host image to uniquely identify the ownership. In this paper a novel, intelligent, scalable, robust wavelet-based watermarking approach is proposed. The proposed approach employs a genetic algorithm to find nearly optimal positions to insert watermark. The embedding positions coded as chr...
متن کاملRobust and Efficient Algorithms
In this paper, we present two new, very efficient and accurate algorithms for computing optical flow. The first is a modified gradient-based regularization method, and the other is an SSD-based regularization method. To amend the errors in the image flow constraint caused b y the discontinuities in the brightness function, we propose to selectively combine the image flow constraint and the cont...
متن کاملEfficient Theoretic and Practical Algorithms for Linear Matroid Intersection Problems
Efficient algorithms for the matroid intersection problem, both cardinality and weighted versions, are presented. The algorithm for weighted intersection works by scaling the weights. The cardinality algorithm is a special case, but takes advantage of greater structure. Efficiency of the algorithms is illustrated by several implementations on linear matroids. Consider a linear matroid with m el...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of Big Data
سال: 2023
ISSN: ['2196-1115']
DOI: https://doi.org/10.1186/s40537-023-00717-4